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Description of Accompanying CD-ROM (37 C.F.R. 1.52 & 1.58V 

Tables 1 A, IB, IC (also referred herein as Table 1), 2, 3 and 4 referred to herein are filed herewith 
on CD-ROM in accordance with 37 C.F.R. §§ 1.52 and 1.58. Two identical copies (marked "Copy 
1" and "Copy 2") of said CD-ROM, both of which contain Tables lA, IB, IC, 2, 3 and 4, are 
submitted herewith, for a total of two CD-ROM discs submitted. 

Table lA is recorded as "Table lA.txt" created July 12^2001, size 3,896KB. 

Table IB is recorded as "Table IB.txt" created on July 12, 2001, size 4,168KB. 

Table IC is recorded as "Table IC.txt" created on July 12, 2001, size 3,944KB. 

Table 2 is recorded as 'Table 2.txt" created on July 12, 2001, size 3,949KB, 

Table 3 is recorded as "Table 3.txt" created on July 12, 2001, size 3,947KB. 

Table 4 is recorded as "Table 4.txt" created on July 12, 2001, size 3,947KB. 

The contents of all files contained on the CD-ROM discs submitted with this appUcation are hereby 
incorporated by reference into the specification. 

Field of the Invention 

The present invention relates to the provision of a high resolution crystal structure of the prokaryotic 
30S ribosome subunit, particularly the 30S ribosome subunit bound to various antibiotics, and the 
use of this structure in drug discovery. 



Background of the Invention 



The wealth of information made available through efforts in structural genomics and advances in 
computation has allowed structure-based drug design to emerge as a valuable tool in medicinal 
5 chemistry. In the past combinatorial chemistry, coupled with high-throughput approaches, shifted 
attention away from the more structure-based methods. Large-scale determination of protein 
structures is reversing the drug discovery process by starting with the protein structure and using it to 
identify and design new Hgands. It is the integration of structure-based methods, virtual screening, 
and combinatorial chemistry that will provide the basis for more efficient drag design in the future, 

1 0 significantly reducing the time of the design cycle and the cost per marketed drug. Significant 
advances have already been made in AIDS, arthritis and cancer and in the treatment of hypertension 

iS (e.g. captopril). 

; Translation of the genetic code occurs on the ribosome, a large nucleoprotein complex that consists 
ti of two subunits. In bacteria, the two subunits are denoted SOS and SOS. The SOS subunit contains the 
py catalytic site of peptidyl transferase activity, while the SOS subunit plays a crucial role in decoding 

messenger RNA. Protein synthesis is a complex, multistep process that requires several extrinsic 
H GTP-hydrolysing protein factors during each of tiie main stages of initiation, elongation and 

1 y termination. Despite several decades of work, the molecular details of the process are poorly 

11 xmderstood, and the elucidation of the mechanism of translation is one of the fundamental problems 
in molecular biology today. A recent collection of articles summarizes the state of understanding of 
the field [1]. 

A contribution to this problem was made by Yonath and coworkers, who after nearly a decade of 

2 5 work showed that structures as large as the SOS ribosomal subunit would form crystals that diffract 

beyond 3 A resolution [2]. Originally, it was not clear that phase information from such a large 
asymmetric unit could be obtained to high resolution, but the development of bright, tunable 
synchrotron radiation sources, large and accurate area detectors, vastiy improved crystallographic 
computing, and the advent of cryocrystallography have all contributed to making structural studies 

3 0 of the ribosome more tractable. In our work, the use of anomalous scattering from the LIII edges of 

lanthanides and osmium has also played a critical role in obtaining phases. 
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The 308 ribosomal subunit (hereafter referred to as 308) from Thermus thermophilus was originally 
crystallized by Trakhanov et al in 2-methyl-2,4-pentanediol (MPD) [3] and soon afterwards by 
Yonath and coworkers in a mixture of ethyl-butanol and ethanol [4]. Subsequent work by both 
5 groups showed that the MPD crystal form diffracted to about 9-12A resolution [5, 6]. The diffraction 
limit of these crystals did not improve beyond 7A resolution for almost a decade, but more recently 
both Yonath and coworkers [7, 8] and we [9] obtained crystals of the MPD form that exhibit 
significantly improved diffraction. However, unlike the crystals obtained by the Yonath group [6], 
our crystals do not require soaking in tungsten clusters or heat treatment in order to obtain high 
1 0 resolution diffraction. 

0 We have previously described the structure of the 308 at 5.5A resolution [9]. We were able to place 
all seven proteins whose structures were known at the time, infer the structure of protein 820 to be a 
-^f. three-helix bundle, trace the fold of an entire domain of 168 RNA, and identify a long RNA helix at 
Ji the interface that contains the decoding site of the 308. Proteins 85 and 87 were also placed in 
Plj electron density m^s of the 308 obtained by Yonath and coworkers. 

^-1 The 308 ribosomal subunit is a major target for antibiotics. The ribosome is a useful target for 
j J antibiotics since the structure of the 308 is widely conserved between prokaryotes, allowing for 
2% broad spectrum antibiotics. However, resistance to current antibiotics is currently a major problem 
in the field of medicine. There are presently very few new antibiotics available which can be used to 
treat the highly resistant strains of bacteria such as MR8A (methicilin resistant Staphylococcus 
aureus) which are becoming increasingly widespread. 

2 5 Understanding the interaction of antibiotics with the ribosome at the molecular level is important for 

two reasons. Firstly, antibiotics act by interfering with various aspects of ribosome function. Thus 
understanding their interaction will help shed Ught on mechanisms involved in translation. Secondly, 
a detailed knowledge of antibiotic interactions with the ribosome could aid the development of new 
drugs against increasingly resistant strains of bacteria. Although antibiotics were characterized 

3 0 several decades ago, a detailed knowledge of their mechanism will in general require a three- 

dimensional structure of their complex with the ribosome. 
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The low (greater than 3 A) resolution crystal structures described above do not provide sufficiently 
detailed resolution for useful modelling of the crystal structure of the SOS and there is thus a need for 
a high resolution structure which can be useful in the development of novel therapeutics. 

5 

All references cited herein, including patent applications and publications, are incorporated by 
reference in their entirety. 

Summarv of the Invention 

10 

We have now solved and refined the structure of the 30S bound to a number of antibiotics at 3 A 
□ resolution. The structure contains all of the ordered regions of 16S RNA and 20 associated proteins, 
J and contains over 99% of the RNA sequence and 95% of the protein sequences, with the missing 

0 parts being exchisively at the termini of RNA or polypeptide chains. 

m refined atomic resolution models of the 3 OS presented here allows the interpretation of a vast 

amount of biochemical data on its function in precise structural terms. The structure Avill also serve 
SJ as a basis for the interpretation in molecular terms of lower resohition models of various functional 
Q states by electron-microscopy or X-ray crystallography. The 30S structure will help produce testable 
115 models for various aspects of ribosome function. 

1 SOT. 

hi a first aspect, the present invention provides a crystal of the Thermus thermophilus 30S subunit 
bound to an antibiotic Z (where Z is defined below), having a tetragonal space group P4i2i2 with 
unit cell dimensions, for each of the antibiotics Z, of about: 



z 


a( Angstroms) 


b(Angstroms) 


c( Angstroms) 


Paromomycin 
(Tables lA/B) 


401.375 


401.375 


175.887 


Paromomycin 


401.2 


401.2 


176.4 


Streptomycin 


401.375 


401.375 


175.887 


Spectinomycin 


401.375 


401.375 


175.887 
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Tetracycline 


401.158 


401.158 


176.944 


Pactamycin 


401.719 


401.719 


177.002 


Hygromycin B 


402.063 


402.063 


175.263 



An advantageous feature of these structures are that they diffract beyond 3 A resolution. Another 
feature of the structures is that they were obtained in a method which did not involve soaking 
crystals in heavy atom (e.g. tungsten or tantalum) clusters or heat activation. Furthermore, it is 
5 specifically of the 885-888/910-912 base pairing confirmation of 16S RNA. These features, both 
singly and in combination all contribute to features of the invention which are advantageous. 

In a second aspect, the invention also provides a crystal of 308 having the three dimensional atomic 
a coordinates of Table n, v^here Table n is any one of Tables 1 to 4. The binding of the antibiotics 
ij) paromomycin, spectinomycin and streptomycin are shovra in the coordinates of Table 1 A. More 
i'S refined coordinates are shown in Table IB, Thus, in some embodiments, reference herein to "Table 

1 y 1 " is a reference to either of Table 1 A or IB (or where the context permits, both; i.e., reference to 

ry "Table 1" can refer to Table 1 A and^^ Table IB). Fiirther, Table IC represents a further data set for 
1^ the antibiotic paromomycin bound to the 30S ribosome, and thus where the antibiotic Z is 
±4 paromomycin, reference to Table 1 (or Table n where 1 is included) means any one of 1 A, IB or IC 
Ly or any combination of Tables 1 A, IB, and/or IC, i.e.. Table 1 A and/or Table IB and/or Table IC, 
preferably Table 1 A or Table IB or Table IC. 

Thus, for example, where it is stated that the invention refers to computer readable media with 

2 0 "atomic coordinate data according to Table 1 recorded thereon", this means that the media has either 

the data of Table lA, or the data of Table IB (and in the case of paromomycin, or Table IC) or both 
(or in the case of paromomycin, any two or all three of said tables), recorded thereon. Likewise, 
reference to Table n where n is 1, also means either or both of 1 A and/or IB (and in the case of 
paromomycin, and/or IC). 

25 

Equally, it will be understood that Table 1 A or Table IB may be provided with the data derived from 
the 30S ribosome with only a single antibiotic data set provided, and such a table forms a part of the 
invention. 
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Table 1 provides the coordinates of the 30S ribosome bound to the antibiotics paromomycin, 
spectinomycin and streptomycin (1 A and IB), and paromomycin alone (IC), as described above. 
Table 2 provides the coordinates of the 308 ribosome bound to tetracycline (primary and secondary 
5 sites). Table 3 provides the coordinates of the SOS ribosome bound to pactamycin. Table 4 provides 
the coordinates of the 30S ribosome bound to hygromycin B. 

We have also observed that 30S crystals do not contain the SI subunit protein, hi our studies, we 
have found that by selectively removing this protein prior to crystallization, we have been able to 

1 0 obtain the improved resolution described herein. Although the atomic co-ordinates provided in 

Table n below allows those of skill in the art to bypass the need to undertake the crystalUzation of 

□i the 30S, this crystalUzation method nonetheless fDrms a further aspect of the invention. 

Accordingly, there is provided a method for crystallizing a 30S subunit to obtain a high resolution 
IS structure of a SOS subxmit, which method comprises providing a SOS subunit, selectively removing 
ry the SI subunit thereJBrom, crystalKzing the SOS and soaking into the crystals antibiotic X. 

y In a further aspect, the present invention provides a method for identifying a potential inhibitor of 
i „ I the SOS comprising the steps of: 

29 a. employing a three-dimensional structure of SOS, or at least one sub-domain thereof, to 

characterize at least a site bound by antibiotic X, the three-dimensional structure being defined by 
atomic coordinate data according to any one of Tables 1 to 4; and 

b. identifying the potential inhibitor by designing or selecting a compound for interaction with the 
active site. 

25 

Li a further aspect, the present invention provides computer readable media with either (a) atomic 
coordinate data according to Table n recorded thereon, said data defining the three-dimensional 
structure of SOS or at least one sub-domain thereof, or (b) structure factor data for SOS recorded 
thereon, the structure factor data being derivable firom the atomic coordinate data of Tables n. 

30 

Description of the Drawings. 
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Figure 1 shows the binding of paromomycin to the SOS ribosome. 

Figure 2 shows the binding of spectinomycin to the 308 ribosome. 

Figure 3 shows the binding of streptomycin to the 30S ribosome. 

Figure 4A shows the binding of tetracycKne (primary site) to the 30S ribosome. 

Figure 4B shows the binding of tetracychne (secondary site) to the 30S ribosome. 

Figure 5 shows the binding of pactamycin to the 30S ribosome. 

Figure 6 shows the binding of hygromycin B to the 30S ribosome. 

Description of Tables on Accompanying CD-ROM (37 C.F.R. 1.52 & 1.58V 

Tables 1 A, IB, IC, 2, 3 and 4 referred to herein are filed herewith on CD-ROM in accordance with 
37C.F.R. §§ 1.52andl.58. Two identical copies (marked "Copy 1" and "Copy 2") of said CD- 
ROM, both of which contain Tables 1 A, IB, IC, 2, 3 and 4, are submitted herewith, for a total of 
two CD-ROM discs submitted. The contents of all files contained on the CD-ROM discs submitted 
with this appHcation are hereby incorporated by reference into the specification. 

Table lA is recorded as "Table 1 A.txt" created July 12, 2001, size 3,896KB, and provides the 
coordinates of the 30S ribosome boimd to the antibiotics paromomycin, spectinomycin and 
streptomycin. 

Table IB is recorded as "Table IB.txt" created on July 12, 2001, size 4,168KB, and provides the 
coordinates of the 30S ribosome bound to the antibiotics paromomycin, spectinomycin and 
streptomycin. 

Table IC is recorded as "Table IC.txt" created on July 12, 2001, size 3,944KB, and provides the 
coordinates of the 3 OS ribosome bound to the antibiotic paromomycin alone. 

Table 2 is recorded as "Table 2.txt" created on July 12, 2001, size 3,949KB, and provides the 
coordinates of the 30S ribosome bound to tetracycline (primary and secondary sites). 

Table 3 is recorded as "Table 3.txt" created on July 12, 2001, size 3,947KB, and provides the 



coordinates of the 30S ribosome bound to pactamycin. 

Table 4 is recorded as "Table 4.txt" created on July 12, 2001, size 3,947KB, and provides the 
coordinates of the BOS ribosome bound to hygromycin B. 

5 

Detailed Description of the Invention. 
Definitions, 

"A", "an", "the" and the Kke, unless otherwise indicated include plural forms. 

1 0 The term "sub-domain" includes any one or more of the following: 

(a) an element selected from the following: 

□ at least one complete element of secondary structure, i.e. an alpha hehx or a beta sheet, or RNA 
.'S helix, as described in the detailed description below; 

a group of two or more such elements which interact with each other; 
IB at least one subunit protein; 

a subgroup of subunit proteins, for example a group which includes two or more proteins which are 

found to interact with each other; 
SI any of the above, when the protein(s) or element(s) thereof is used in conjunction with all or part of 

1 ,| the 16S RNA structure associated with said element(s) or protein(s); 

(b) a space of volume defining a region around any one particular atom of interest (e.g. an atom 
involved in binding to an antibiotic), the volume being less than the total volume of the tetragonal 
space of the complete crystal. For example, the coordinates of atoms in a volume of fi-om about 500 
to about 15,0OOA^ may be selected and used for the present invention. Such a space may be a sphere 

2 5 having a diameter of fi-om about lOA to about 30A, centred around a point of interest; and 

(c) a collection of at least about 10, e.g. at least about 25 such as at least about 50, more preferably at 
least about 100, even more preferably at least about 500 atoms and most preferably at least about 
1000 atoms defined by the coordinates of Table n, wherein at least 2 of said atoms, and preferably at 

3 0 least about 50% of said atoms of the collection are located within about 50A of each other. 
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By "fitting", is meant determining, by automatic or semi-automatic means, interactions between one 
or more atoms of an potential inhibitor molecule and one or more atoms or binding sites of the 30S, 
and calculating the extent to which such interactions are stable. Various computer-based methods 
for fitting are described fiirther herein. 

By "root mean square deviation" we mean the square root of the arithmetic mean of the squares of 
the deviations firom the mean. 

An "active site" of the SOS is any part of this structure involved in binding to antibiotic Z, or 
region(s) of the structure whose conformation is altered by the binding of Z. It also includes any 
part of this structure involved in tRNA or mRNA binding, synthesis or translocation, including 
regions of the complex not directly associated with tRNA or mRNA binding but which are required 
for the ribosome to function, for example those regions which undergo structural changes associated 
with protem synthesis or are target sites for regulation by co-factors, phosphorylation or acetylation. 

Particular regions of the SOS include those identified herein as antibiotic binding regions based on 
the data provided in Tables n (1-4), and in the Figures 1-6. Regions fiu^her include the three tRNA 
sites, i.e. the aminoacyl (A), peptidyl (P) and exit (E) sites. Other active sites are those which 
undergo movement during translocation of tRNAs fi*om the A to P sites and the P to E sites. 

"Computer readable media" refers to any media which can be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media such as floppy discs, 
hard disc storage medium and magnetic tape; optical storage media such as optical discs or CD- 
ROM; electrical storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. 

A "computer system" refers to the hardware means, software means and data storage means used to 
analyse the atomic coordinate data of the present invention. The minimum hardware means of the 
computer-based systems of the present invention comprises a central processing unit (CPU), input 
means, output means and data storage means. Desirably a monitor is provided to visuahse structure 
data. The data storage means may be RAM or means for accessing computer readable media of the 



invention. Examples of such systems are microcomputer workstations available from Silicon 
Graphics Incorporated and Sun Microsystems running Unix based, Windows NT or IBM OS/2 
operating systems. 

5 A "ligand" is any chemical moiety (organic or inorganic) that binds or interacts, generally but not 
necessarily specifically, to or with another chemical entity. 

Table n. 

The coordinates of Tables 1 to 4 provide a measure of atomic location in Angstroms, to a third 
1 0 decimal place. In order to use the information in these Tables for the purposes described herein as 

being aspects of the present invention, these coordinates may be varied by up to about + LO, such as 
Q by up to about ± 0.7, preferably no more than up to about ±0.5 Angstroms, without departing from 

the scope of the invention. 

m Furthermore, varying the relative atomic positions of the atoms of the structure so that the root mean 
P|l square deviation of the 16S RNA or S2-S20 protein backbone atoms is less than about l.SA 

(preferably less than about 1 .OA and more preferably less than about 0.5 A) when superimposed on 
^4 the coordinates provided in Table n for these structures, will generally result in a structure which is 

1 ' j substantially the same as the structure of Table n respectively in terms of both its structural 
M characteristics and potency for structure-based drug design of SOS ligands. 

Thus for the purposes described herein as being aspects of the present invention, it is within the 
scope of the invention if: the coordinates of Table n are transposed to a different origin and/or axes; 
the relative atomic positions of the atoms of the structure are varied so that the root mean square 

2 5 deviation of conserved residue backbone atoms is less than about 1 .5 A (preferably less than about 

l.OA and more preferably less than about 0.5A) when superimposed on the coordinates provided in 
Table n for the conserved residue backbone atoms; and/or the number and/or positions of water 
molecules is varied. Reference herein to the use of the coordinates of Table n thus includes the use 
of coordinates in which one or more individual values of the Table are varied in this way. 

30 

Table n includes coordinates of metal ions which are selected from zinc, cobalt and magnesium. 
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Some or all of these ions may optionally be discarded from the Tables when using the data. The 
Tables also list the coordinates of a 26 amino acid peptide, Thx, as well as a 6 nucleotide fragment 
of mRNA, NNNUCU, designated as molecule X. Both the coordinates of both these molecules may 
likewise optionally be discarded, i.e. so that the coordinates of the 16S RNA and the proteins S2 to 
5 S20 alone are modeled and used in appUcations of the invention. 

There are a few N- or C-terminal sequences of the S2 to S20 proteins which were not resolved in the 
structure of Table n, together with some of the 5' and 3' residues of the 16S RNA. These are not 
essential for the purposes of the present invention. 

10 

This invention provides those of skill in the art a means to provide SOS crystals of TJhermophilus, 
□ The conservation of ribosome structure, particularly regions of structure essential for function, 

between prokaryotes, for example prokaryotes which are human pathogens, such as Staphylococcus 

spp, and the like, allows the structure herein to be useful in the provision of anti-bacterial agents in 
TB general. Thus, the structures may be used to solve 308 subunits by the technique of molecular 
I y replacement. In such a method, x-ray diffraction data are obtained from crystals of a 30S subunit 

from another species, e.g. a species of a bacteria pathogenic to humans. The coordinates of Table n 
'^4 may be used to find the orientation of the unknown molecule in the crystal, and electron density 
; J maps calculated. These maps can then be interpreted with the sequence of the species in question, 
M and the coordinates of our 30S structure can be used to help and speed interpretation. In this way, the 

structure of our 30S facilitates the determination of structures of 308 subunits and whole ribosomes 

from other organisms. 



Accordingly, the invention provides a method for the determination of the structure of a bacterial 
2 5 308 from a species other than T. thermophilus which method comprises: 

(a) crystalHsing the 308 of said species to obtain a crystal; 

(b) performing X-ray crystallography on said crystal to obtain X-ray diffraction data; 

(c) providing the structure data of Table n; and 

(d) using molecular replacement to calculate an electron density map of the 308. 

30 

The crystallisation step (a) is optionally performed with an antibiotic Z, either in a co-crystallisation 
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or by soaking the antibiotic following crystal formation. Thus the calculated electron density map 
may be that of the 30S - antibiotic complex. 

In such a method the 308 may be prepared by removal of the SI subunit, as described herein. 

5 

The electron density map obtained may then be used to calculate the atomic coordinate data of the 
30S, optionally with bound antibiotic Z. The atomic coordinate data thus obtained may be used to for 
the design and analysis of new and specific ligands for 30S as described herein. 

1 0 The SOS crystal structure. 

The high resolution structure provided herein provides a crystal with unit cell dimensions which are 
O provided in the accompanying table to 3 decimal places, as set out above. However, those of skill in 

the art wishing to reproduce the crystallization described herein and obtain such crystals will 
J^fj appreciate that a degree of experimental variability and error will mean that crystals of the invention 
W will be obtained with a unit cell dimension within, but not exactly corresponding to, this size. Thus 

crystals of the invention may generally be defined as having unit cell dimensions a, b, and c as 
;L defined above which vary in the case of a by ± about 4.0A, b by ± about 4.0A and c by ± about 
'^J 5.0A, preferably a by + about 1 .OA, b by ± about 1 .OA and c by + about 2.0A. More preferably the 
JTJ variance is no more than a ± about 0.7A, b ± about 0.7A and c ± about 1 .4A, and even more 

11 preferably no more than a ± about 0.2A, b ± about 0.2A and c ± about 0.4A. These unit cell sizes 
are believed to define novel and more highly resolved unit cell sizes than has previously been 
possible in the art. 

Production of crystals. 

25 To obtain crystals according to the present invention, we have foimd that selective removal of the S 1 
subunit protein is advantageous. This may be achieved by the use of hydrophobic interaction 
chromatography column (poros-ET). 30S ribosomal subunits lacking the SI subunit may suitably be 
separated fi*om those containing the SI subunit by running a colxmm using a reverse ammonium 
sulfate gradient from 1.5M to 0.5M, with 20mM Hepes, pH 7.5, and lOmM acetate. The 30S 

3 0 subunits lacking SI are eluted first, giving the first major peak. During elution of the 30S peak the 
ammonium sulfate concentration is maintained at a constant level. Once the 30S peak has eluted the 
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ammonium sulfate concentration is then further reduced to elute the 30S + SI jfraction. 

An alternative method for the selective removal of the SI subunit protein is by preparative sepharose 
or by gel electrophoresis. Gel electrophoresis may suitably be carried out by first preparing and 
mixing a 3% acrylamide, 0.5% agarose cylindrical gel, and pouring this gel into a BioRad Prep Cell. 
BOS ribosomal submits are then loaded onto the gel and continuously eluted as they emerge form the 
other end of the gel. The 30S fraction lacking the SI subunit comes off first, giving the first major 
peak. The SOS + SI fraction gives the trailing peak (or shoulder) and can be discarded. 

Once the SI is removed, the crystals may be formed, using suitable conditions. These include the 
use of 13-17% v/v methyl-2,4-pentanediol in the presence of 200-300 (e.g. about 250) mM KCl, 50- 
100 (e.g. about 75) mM ammonium chloride, 15-30 (e.g. about 15 or about 25) mM MgCl2 at a pH 
of 6.0 - 7.5 (e.g about pH 6.3 - 6.7 such as pH 6.5) in 50 - 150 (e.g. about 100) mM sodium or 
potassium cacodylate or MES (2-(N-morpholino)ethane sulphonic acid). 

In a particular aspect, the conditions may comprise the use of 250 mM KCl, 75 mM NH4CI, 25 mM 
MgCl2, 6 mM 2-mercaptoethanol in 0.1 M potassium cacodylate or 0.1 M MES (2-N-morpholmo- 
ethanesulfonic acid) at pH 6.5 with 13-17% MPD as the precipitant. 

The crystals may be grown by any suitable method known as such to those of skill in the art. 
Suitably, the crystals may be grown over a period of 4-8 weeks at about Once crystals are 
obtained, the antibiotic Z may be soaked into the crystals. The antibiotic may be used in any 
convenient soluble form at a concentration range of from about 10 to about 500 /xM, preferably from 
about 50 to about 100 /iM, such as about 80 fiM, The structure of the crystals so obtained may be 
resolved, and crystals which resolve to a resolution of at least about 3A selected. Crystals which 
resolve to a resolution of at least about 3 A obtainable by such a method are a further aspect of the 
invention. 

Uses of the structural data of Table n. 

The determination of the three-dimensional structure of 30S provides a basis for the design of new 
and specific ligands for 30S. For example, knowing the three-dimensional structure of 30S, 
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computer modelling programs may be used to design different molecules expected to interact with 
possible or confirmed active sites, such as binding sites or other structural or functional features of 
SOS. 

The high resolution models of the 308 provided by Table n can be used to examine and determine 
the binding of the antibiotics paromomycin, streptomycin, spectinomycin, tetracycline, pactamycin 
and hygromycin B to the 30S ribosome, and by using this information, the skilled person in the art 
can design hgand which may compete with these antibiotics and which can overcome the resistance 
of bacterial cells to these antibiotics. 

A candidate Hgand, particularly but not necessarily one which acts as an inhibitor molecule may be 
any available compound. A nimiber of commercial sources of libraries of compound structures are 
available, for example the Cambridge Structural Database. Such libraries may be used to allow 
computer-based high throughput screening of many compounds in order to identify those with 
potential to interact with the active site of a ribosome. 

More specifically, a potential ligand capable of modulating 30S activity can be examined through 
the use of computer modelling using a docking program such as GRAM, DOCK, or AUTODOCK 
(see Walters et al., Drug Discovery Today, Vol.3, No.4, (1998), 160-178, and Dunbrack et aL, 
Folding and Design, 2, (1997), 27-42) to identify potential Ugands of 30S, This procedure can 
include computer fitting of potential ligands to 30S to ascertain how well the shape and the chemical 
structure of the potential hgand will bind to the enzyme. 

Also computer-assisted, manual examination of the active site structure of 30S may be performed. 
The use of programs such as GRID (Goodford, 1 Med, Chem., 28, (1985), 849-857) - a program that 
determines probable interaction sites between molecules with various fimctional groups and the 
enzyme surface - may also be used to analyse the active site to predict partial structures of Ugands 
for the site. 

Computer programs can be employed to estimate the attraction, repulsion, and steric hindrance of the 
two binding partners (e.g. the 30S and a potential ligand). Generally the tighter the fit, the fewer the 
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steric hindrances, and the greater the attractive forces, the more potent the potential ligand since 
these properties are consistent with a tighter binding constant. Furthermore, the more specificity in 
the design of a potential ligand, the more likely it is that the ligand will not interact with other 
proteins as well. This will tend to minimise potential side-effects due to unwanted interactions with 
5 other proteins. 

In one aspect, the above described methods may be used to perform a computer-based method of 
rational drug design which comprises: 

providing the structure of the 308 ribosome defined by the coordinates found in Table n; 
1 0 providing the structure of a candidate inhibitor molecule; 

fitting the candidate to the structure of the 308 to provide a result; and 
Q comparing the result with a structure comprising the 308 of the Table together with the 

IIJ antibiotic coordinate data of said Table. 

m In the case where Table n is Table 2 (tetracycline), the comparison may be with one or other, or 
Pi I both, bound tetracycline molecules. 

SJ The 308 ribosome used in the present invention comprises an additional small protein molecule, 
I Thx, as well as a short sequence of nucleotides designated molecule X. It will be understood that the 

is phrase "the structure of the 308 ribosome as defined by the coordinates of Table n" and the like 
(where n is any one of 1 to 4) as used above and elsewhere herein is reference to the coordinates 
defined by atoms of the 168 RNA and proteins 82 to 820 of the Table n, including or not including 
the Thx and molecule X coordinates, optionally in conjunction with any or all of the metal ions 
defined by Table n. 

25 

The data of Table n indicate that the primary contacts between antibiotic Z and the 308 are mediated 
by the 168 RNA. Thus in the above aspect of the invention, those of skill in the art may choose to 
use the data of Table n relating to the 168 RNA and one of the antibiotics Z in the process of drug 
design. Accordingly, there is also provided a computer-based method of rational drug design which 
30 comprises: 

providing the structure of the 168 RNA of the 308 ribosome as defined by the coordinates of 
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Table n; 

providing the structure of a candidate inhibitor molecule; 

fitting the structure of candidate to the structure of the 16S RNA of the 308 to provide a 
result; and 

comparing said result with a structure comprising the 16S RNA of the 308 of said Table 
together with the antibiotic structure of said Table. 

In an alternative aspect, the method of the invention may utihse the coordinates of atoms of interest 
of the 308 which are in the vicinity of an antibiotic Z binding region in order to model the pocket in 
which Z binds. These coordinates may be used to define a space which is then screened "m silico " 
against a candidate inhibitor molecule. Thus the invention provides a computer-based method of 
rational drug design which comprises: 

providing the coordinates of at least one atom of Table n of the 308 ribosome; 
providing the structure of a candidate inhibitor molecule; 

fitting the structure of candidate to the coordinates of the 308 ribosome provided to obtain a 
result; and 

comparing said result with a structure comprising the coordinates of the 308 ribosome 
provided and at least one atom firom one antibiotic structure of Table n. 

In this embodiment, the at least one atom of the 308 ribosome provided will preferably be within a 
distance of about 50, preferably about 10 Angstroms of at least one of the atoms of any of the 
antibiotic molecule described in Table n. 

In practice, it will be desirable to model a sufficient number of atoms of the 308 ribosome as defined 
by the coordinates of Table n which represent a binding pocket. Binding pockets and other features 
of the interaction of antibiotic Z with the 308 ribosome are described in the accompanying examples. 
Thus, in this embodiment of the invention, there will preferably be provided the coordinates of at 
least 5, preferably at least 10, more preferably at least 50 and even more preferably at least 100 
atoms such as at least 500 atoms and most preferably at least 1,000 atoms of the 308 ribosome. Of 
these atoms provided, at least one will preferably be within the distance mentioned above of the 
antibiotic molecules described in Table n. 



Likewise, when a candidate is fitted to the selected coordinates of the 30S ribosome the comparison 
with antibiotic is preferably made by reference to at least 3, such as at least 5, for example at least 8, 
more preferably at least 16 of the atoms of any of the antibiotic Z structures provided in Table n. 

Li another aspect, the method of the invention may utilise a sub-domain of interest of the SOS which 

is in the vicinity of a antibiotic binding region. Thus, the invention provides a computer-based 

method of rational drug design which comprises: 

providing the coordinates of at least a sub-domain of the SOS ribosome; 

providing the structure of a candidate inhibitor molecule; 

fitting the structure of the candidate to the coordinates of the SOS ribosome sub-domain provided to 
obtain a result; and 

comparing said result with a structure comprising the coordinates of the SOS ribosome of same sub- 
domain provided and at least one atom from the antibiotic Z structure of Table n. 

In a further aspect, the accompanying examples and drawings show the specific sites of interaction 
of the antibiotic Z with the SOS ribosome. These data may be used to design ligands which interact 
with at least one of the sites of interaction of each identified antibiotic Z, and preferably at least 
about 50% of the sites of interaction identified for each separate antibiotic Z in each of Figures 1 to 
6. Such ligands may be designed by providing atomic coordinate data for at least one of the 
following nucleic acid or amino acid residues of the SOS: 

Group I: G1405, AMOS, C1490, G1491, A149S, G1494 andU1495; 
Group II: G1064, C1066, G1068 and CI 192; 

Group III: U14, C526, G527, A91S, A914, C1490, G1941 and S12Lys45; 
Group IV: A965, G966, G105S, C1054, CI 195, U1196, G1197 and G1198; 
Group V: U244, A892 and C89S; 

Group VI: G693, A694, C788, C795, C796, S7Gly81 and optionally U 1540; 
Group VII: C140S, G1405, G1494, U1495, C1496 and U1498, 

providing a potential ligand, and 



fitting said ligand to the 308 to determine the interaction of the ligand with at least one chemical 
group present in the nucleic acid or amino acid residue of the selected group. 

Preferably at least half the members of each group are used, and more preferably firom half to t 
members of each group are used, where t represents a number which is more than half and at least T, 
preferably T-1 and more preferably T-2 where T is the total number of members of each group, 
subject to the requirement that t is greater than T/2 (i.e. for group II 2, 3 or 4 members may all be 
used, and for group V, 2 or 3 members may be used). 

In another aspect, in place of m silico methods, high throughput screening of compounds to select 
compounds with ribosome binding activity may be undertaken, and those compounds which show 
ribosome binding activity may be selected as possible candidate inhibitors, and fiirther crystallized 
with 30S (e.g. by co-crystallization or by soaking) for x-ray analysis. The resultmg x-ray structure 
may be compared with that of Table n for a variety of purposes. For example, where the contacts 
made by such compounds overlap with those may by antibiotic Z, novel molecules comprising 
residues which contain contacts of both Z and the other inhibitor may be provided. 

Having designed or selected possible binding ligands, these can then be screened for activity. 
Consequently, the method preferably further comprises the further steps of: 
obtaining or synthesising the potential ligand; and 

contacting the potential ligand with 30S to determine the ability of the potential ligand to interact 
with 30S. 

More preferably, in the latter step the potential Ugand is contacted with 30S under conditions to 
determine its function, for example in a cell free translation system. Such conditions (including cell 
free translation systems) are known in the art. 

Instead of, or in addition to, performing such an assay, the method may comprise the further steps of: 
obtaining or synthesising said potential hgand; 
forming a complex of 30S and said potential ligand; and 

analysing said complex by X-ray crystallography to determine the ability of said potential ligand to 
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interact with 30S. 



Detailed structural information can then be obtained about the binding of the potential ligand to 30S, 
and in the light of this information adjustments can be made to the structure or functionality of the 
potential ligand, e.g. to improve binding to the active site. These steps may be repeated and re- 
repeated as necessary. 

Another aspect of the invention includes a compound which is identified as an hgand of 308 by the 
method of the above aspects of the invention. 

In another aspect, the invention provides a method of analysing a 30S-Hgand complex wherein the 
ligand has been obtained by the methods of the invention described above, comprising the steps of 
(i) cocrystallising the 308 with the ligand or soaking the ligand into crystals of the 308; (ii) 
collecting X-ray crystallographic diffraction data from the crystals of the 30S-ligand complex and 
(iii) using the three-dimensional structure of 308 of Table n, or at least one sub-domain thereof, to 
generate a difference Fourier electron density map of the 30S-ligand; and (iv) modelling the Ugand 
in the difference Fourier electron density. 

Therefore, 308-ligand complexes can be crystaUised and analysed using X-ray diffraction methods, 
e.g. according to the approach described by Greer et al., J, of Medicinal Chemistry, Vol. 37, (1994), 
1035-1054, and difference Fourier electron density maps can be calculated based on X-ray 
diffraction pattems of soaked or co-crystaUised 308 and the solved structure of uncomplexed 308. 
These maps can then be used to determine the structure of the ligand bound to the 308 and/or 
changes the conformation of 308. 

Data obtained from a ligand bound to 308 may be used to improve the ligand, for example by adding 
or removing fimctional groups, substituting groups or altering its shape to obtain improved 
candidates, which may then be screened, solved in complex as described herein above, in an iterative 
process. 

Electron density maps can be calculated using programs such as those from the CCP4 computing 



package (Collaborative Computational Project 4. The CCP4 Suite: Programs for Protein 
Crystallography, Acta Crystallographica, D50, (1994), 760-763.). For map visuaUsation and model 
building programs such as "O" (Jones et aL, Acta Crystallograhy, A47, (1991), 1 10-1 19) can be 
used. 

The high resolution data provided herein allows those of skill in the art who have obtained structures 
of worse resolution of the 308 to refine such structures in the Ught of the data of Table n. Thus in a 
further aspect, the invention provides a method for modelling a structure of a 30S ribosome which 
comprises providing an atomic model of a structure at a resolution of worse than about 3A (e.g. a 
resolution of worse than about 5 Angstroms, such as about 5-12 A), comparing the structure 
obtained with the data of Table n, and refining said model obtained to resolve the structure in order 
to provide a higher resolution structure. Such a process will be useful for the refinement of a 30S 
itself, or the 3 OS in various functional states as part of the 70S ribosome (e.g. bound to mRNA, 
elongation factors or the like). 

Such a method will be useful in providing the structure of the 30S ribosome fi-om other bacterial 
sources, since the overall secondary and tertiary structure of such ribosomes will be highly 
conserved in comparison to the T. thermophilus structure provided herein. The data provided herein 
may be used in a process of modelling the 30S of other species ab initio by homology modeUing 
using energy minimization criteria. 

By providing such computer readable media, the atomic coordinate data can be routinely accessed to 
model 30S or a sub-domain thereof For example, RASMOL is a pubhcly available computer 
software package which allows access and analysis of atomic coordinate data for structure 
determination and/or rational drug design. 

On the other hand, structure factor data, which are derivable from atomic coordinate data (see e.g. 
Blundell et al, in Protein Crystallography, Academic Press, New York, London and San Francisco, 
(1976)), are particularly useful for calculating e.g. difference Fourier electron density maps. 

In another aspect, the present invention provides systems, particularly a computer systems, intended 



20 



to generate structures and/or perform rational drug design for 30S ligand complexes, the systems 
containing either (a) atomic coordinate data according to Table n, said data defining the three- 
dimensional structure of 30S or at least one sub-domain thereof, or (b) structure factor data for SOS, 
said structure factor data being derivable from the atomic coordinate data of Table n. 

5 

Mutant strains resistant to the action of these antibiotics can arise through mutation of a protein 
subunit of the 30S or through mutation or modification in the 16S RNA (e.g. 2'0-methylation), or 
modification (e.g. acetylation) of the antibiotic). The sites of mutations in some cases are known or 
can be identified. Where such sites are identified through, for example, primary sequence data, the 
1 0 invention provides a means to model the structure of the mutants. 

□ There is thus provided a method which comprises providing the structure of the SOS ribosome of 
J Table n, changing one amino acid or nucleotide of said structure to provide a mutant SOS, and 

modeling the structure of the mutant SOS to provide a structure of the mutant. The mutant may be 

y i 

111 used in the manner described above for the wild type, e.g. stored in computer readable form, 
Plj modeled to provide ligands, and the like. The modeling may be based upon the predicted behavior 
of the atoms of the changed amino acid based upon its interaction with the surrounding atoms in the 
M model provided herein. 

2;B This process may be iterative, e.g. to produce successive mutations into the SOS structure, for 
example 2, S, 4, or 5 to 10 mutations or more. 

Regions of SOS which may be subject to this aspect of the invention include but are not limited to 
those regions identified in the accompanying examples as regions of the SOS involved in binding to 
25 antibiotics. 

In a fiirther aspect, the present invention provides a means to solve or interpret electron density maps 
of the whole 70S ribosome at low or high resolution, and thus solve the structure of the whole 70S 
ribosome. 

30 

In particular, the invention provides a method for the determination of the structure of a bacterial 
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70S ribosome which method comprises 

(a) crystalHsing the 70S of said species to obtain a crystal; 

(b) performing X-ray crystallography on said crystal to obtain X-ray diffraction data; 

(c) providing the structure data of Table 1 ; and 

(d) using molecular replacement to calculate an electron density map of the 70S. 

The crystaUisation step (a) is optionally perfomied with an antibiotic Z, either in a co-crystallisation 
or by soaking the antibiotic following crystal formation. Thus the calculated electron density map 
may be that of the SOS - antibiotic complex. 

The invention is illustrated, but not limited, below by the following examples, their accompanying 
Figures and Tables. In Table n there is shown in each row Atom number, element type, residue 
(amino acid, nucleotide, etc), number in molecule (for proteins N to C terminal direction, for nucleic 
acid 5* to 3' direction), X, Y and Z co-ordinates, occupancy, B factor (A^) and an identifier for the 
member of the SOS. 

Throughout the accompanying example, we use the numbering system for coli 16S RNA, as well 
as the standard heUx numbering, denoted H1-H45, for the secondary structure elements [19] with 
some modifications as shown in Figure 1. The most significant differences between the E, coli and 
r. thermophilus sequences are a shorter H6 and HIO, and insertions in H9 and HSSa. Any insertions 
in T. thermophilus relative to E. coli are indicated in the coordinates with an insertion letter after the 
nucleotide number, following the practice for tRNA. 

EXAMPLES 

Example 1 - Crystalliz ation of Paromomycin. Spectinomvcin and Streptomycin to the SOS 
Ribosome. 

Crystallization of the 308, 

Because we observed that the SOS crystals completely lacked ribosomal protein SI, care was taken 
to remove SI selectively from the SOS prior to crystaUization. Crystals were obtained in lS-17% 



MPD over a range of pH in the salt and magnesium conditions described by Trakhanov et al [3], 
The crystals were largest and most reproducibly obtained at a pH of 6.5 in 0. 1 M cacodylate or MES 
buffer. Crystals took approximately 6 weeks at 4'C to grow to their maximum size. The largest 
crystals, which were required for high resolution data collection, grew to a size of 80-100 x 80-100 x 
5 200-300 microns. The activity of redissolved crystals in poly(U)-directed protein synthesis was 
comparable to that of freshly isolated 30S subunits. 

Data collection. 

Crystals were transferred to 26% MPD by vapor diffusion in two steps over a period of 6 days. All 
1 0 crystals (except for those soaked in osmium hexammine or osmixmi pentammine) also contained 1 

mM cobalt hexammine in the cryoprotectant. Crystals were flash-cooled by plunging into Uquid 
□ nitrogen, and data collection was done in a cryostream at 90-100 K. 

I^ii A large fi-action of crystals was screened at beamUnes 9.6 or 14.1 at the SRS at Daresbury 
IB Laboratories, using two short exposures at least 40 degrees apart. These crystals were then analyzed 

for diffraction limits, cell dimensions and mosaic spread. Only crystals of similar cell dimensions 

and with reasonable mosaic spread were used for data collection. 

Potential derivatives were screened on beamhnes X25 at the NSLS at Brookhaven National 
23 Laboratory and BM-14 at the ESRF (Grenoble). Data to about 4.5A were obtained from X25. High 
resolution data were collected at SBC ID-19 at the APS in Argonne National Laboratory, and ID14-4 
at the ESRF. In all cases, derivative data were collected at the peak of the fluorescence at the LIU 
edge to maximize anomalous differences. At X25 and SBC ID-19, the kappa goniostat was used to 
rotate precisely about a mirror plane so that small anomalous differences could be measured 

2 5 accurately. Each crystal typically yielded 3-10 degrees of data. Data were integrated and scaled 

using HKL-2000 [10]. 

Structure determination. 

Previously determined phases at 5.5 A [9] were used to locate heavy atom sites using anomalous 

3 0 difference Fourier maps. Initially, these sites were used for phasing to 3.35 A using the program 

SOLVE [11], followed by density modification with SOLOMON [12], using the procedure 
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implemented in SHARP [13], Optimization of the various parameters in the procedure was required 
to obtain interpretable maps. The RNA and some of the proteins were built using the SOLVE maps. 
The sequence of Thermus thermophilus 16S RNA [14] was used for the structure. For proteins, a 
combination of previously published sequences and new ones from the Gottingen Thermus genome 
5 sequencing project were used. Improved maps were obtained by calculating experimental phases to 
3.2A using SHARP followed by density modification and phase extension to 3.05A with DM [15]. 
The improved maps allowed us to build all the ordered parts of the structure. The model was built 
using O [16], and refined using the program CNS [17], Maximum Hkelihood refinement was used, 
initially with both amplitudes and experimental phase probabihty distributions to 3. 35 A, and 
1 0 subsequently with amplitudes to 3 .05 A. 

□ Results 

.'f; The 30S subunit fi-om Thermus thermophilus consists of a 1522 nucleotide 16S ribosomal RNA [14] 

M and 21 associated proteins, of which 20 have known coxmterparts in E. coU. Protein S21 is not 

TS present in Themius, and protein SI has been removed from the 3 OS prior to our crystallization. In 

il^J addition, a 26 residue peptide, Thx, is present in Thermus 30S subunits [18]. 

y Experimentally phased maps clearly showed main chain density for RNA and protein, individual 
bases (which were often of sufficient quality to distinguish purines from pyrimidines), and large 

M well-ordered side chains of proteins. These maps were used to build 16S RNA and the previously 
unknown proteins S2, S3, S9, SIO, SI 1, S12, S13, S14 and Thx. In addition, regions that were 
disordered in isolated structures or had changed significantly were also built. This often consisted of 
significant portions of the N- and C-terminal tails of the proteins, sometimes including entire 
domains that were unfolded in isolation. Proteins with small cores and long loops, such as S16 and 

25 SI 7, had to be substantially rebuilt, since these loops were generally disordered in the solution NMR 
structures. Finally, the entire structure was rebuilt after an initial round of refinement. Our current 
model consists of nucleotides 5-1511 of Thermus thermophilus 16S RNA (corresponding to 5-1534 
of E. coli 16S RNA) and all of the ordered regions of the associated 20 proteins. The current model 
has been refined against 3.05A data with a conventional R-factor of 0.213, a free R-factor of 0.256 

3 0 and good geometry. For the proteins, 94% of the residues were in the core or allowed regions of the 
Ramachandran plot, 3.9% in the generously allowed region and 1.8% in the disallowed region. 
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Paromomycin, 

Paromomycin is a member of the aminoglycoside family of antibiotics which increases the error rate 
of the ribosome. This family is thought to reduce the dissociation rate of A-site tRNA from the 
5 ribosome. Recent experiments suggest that it affects both initial selection and proofreading. Crystals 
were obtained and the data collected is provided in Table 1. Figure 1 shows the interactions between 
16S RNA residues and paromomycin. 

Paromomycin binds in the major groove of H44 in a location that is in agreement with mutagenesis 
10 and protection data. It is also in general agreement with an NMR structure of paromomycin bound to 

an RNA fragment corresponding to its binding site [20]. Contacts are shown in Figure 1. Ring IV 
i:3 contacts the backbone of both sides of helix in an orientation that differs from the NMR structure, 

while ring III makes only weak contacts with the RNA. Ring II forms tight interactions with both 

bases and backbone of the RNA, while Ring I inserts into the RNA helix and helps flip out bases 
IB Al 1 92 and A1493 when compared to the structure in the absence of paromomycin. Ring I mimics a 
Plj nucleotide base, stacking against G1491 and hydrogen-bonding with A1408. In addition it forms a 

tight H bond interaction with the phosphate backbone of A1493 which helps lock the flipped out 
y bases in place. Except as noted, many of the interactions are similar to those reported in the NMR 
; rj structure [20] although the bases are flipped out to a far greater degree, and consequently, we do not 
is see a base pair between A1408 and A1493. 

We have modelled the codon and anticodon in the A site using a superposition of the 7.8 A structure 
of the 70S ribosome with tRNA and mRNA bound. The flipped out bases point directly into the A 
site and are positioned to interact with the minor groove of the hehx formed by the codon-anticodon 

2 5 interaction. It is probable that the A-site codon-anticodon helix must undergo some degree of 

rotation during or after GTP hydrolysis by EF-Tu, in a conformational change to a proofreading state 
of the decoding site. However, there are rather strict covalent and steric constraints on the A-site 
anticodon and especially the codon, which is covalently attached to both the P-site codon and 
downstream message. Thus, despite some rotational uncertainty in the orientation of the codon- 

3 0 anticodon hehx, it appears unlikely that the 1492-1493 bases could interact with any portion of the 

codon-anticodon helix other than its minor groove, though interactions with other portions of the 
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A-site tRNA anticodon loop are not ruled out. This model provides clues as to how paromomycin 
increases the affinity of the A site for tRNA. It seems likely that in the absence of paromomycin 
some energy is required to flip out A1492-A1493 so they can contact the tRNA, and presumably this 
energetic cost is compensated by the formation of favourable interactions with tRNA. By binding to 
5 H44, paromomycin forms a structure in which these bases are already flipped out, thus reducing the 
energetic cost of both cognate and non-cognate tRNA binding and increasing tRNA affinity for the 
A site. 

This structure is in general agreement with a previously proposed model in which A1492 and 
10 A1493 would make contact with the minor groove of the mRNA-tRNA duplex [21]. hi that model, 

it was suggested that these bases hydrogen bond with the 2 -OH of the message. Given the rotational 
p uncertainty of the positioning of the A-site tRNA m our model, we cannot determine that these 
^5 adenines hydrogen-bond to the message. However, two scenarios appear possible, within the limits 
|3 of the model. The adenines may hydrogen-bond to 2' OH groups of only the tRNA anticodon 
]f| stem-loop, or they may hydrogen-bond to 2'OH groups of both the tRNA and the message. Both 
I ■!! possibihties are attractive because they offer a direct explanation for increased affinity (and lower 

dissociation rate) of tRNA in the presence of the antibiotic. Finally, the degree to which the bases are 
;j flipped out in our structure allows a possible reconciUation of the proposal that the mRNA binds in 

the major groove of H44 23 with the notion that A1492 and A1493 inspect the minor groove of the 
2^3! codon-anticodon interaction. 

Rings I and II of paromomycin are found in a number of other antibiotics including gentamycin. An 
NMR structure of gentamycin bound to the same firagment of H44 showed that these two rings 
interact with RNA in the same way as in paromomycin [20]. This suggests that other 

2 5 aminoglycosides that bind to the decoding centre on H44 induce errors in translation by the same 

mechanism as paromomycin. 

Spectinomycin 

In contrast to the flexible aminoglycosides, the fused ring system in spectinomycin makes it a rigid 

3 0 molecule. It binds in the minor groove at one end of H34. It makes a single contact with a backbone 

phosphate and makes hydrogen bonds to a number of bases (Figure 2). The most interactions are 
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made with G1064 and CI 192, consistent with protection studies [22] and mutagenesis data which 
showed that any combination of substitutions at these bases gave resistance to spectinomycin [23]. 
These two bases are held too far apart to form Watson-Crick base pairs, but are able to make a single 
hydrogen bond. 

5 

A loop of S5 and part of H28 of 16S RNA approach the spectinomycin binding site, but in this state 
do not make direct contacts with it. It is possible, however, that in other conformations of the 308, 
spectinomycin is in more direct contact with these regions. 

10 A superposition of the A, P and E site tRNAs from the 70S ribosome onto our 30S structure shows 

that a number of highly specific contacts from the head stabilize these tRNAs. A movement of tRNA 
P from one site to the other must necessarily involve movement of elements of the head. Such 

movements would involve H34 and a possible rearrangement of the connections between it and 
O helices 35 and 38. The structure suggests that the rigid spectinomycin molecule binds near this pivot 
i| point of the head and sterically blocks movement although it is also possible that it acts to stabilize 
l"^ the upper stem of H34 [23]. As mentioned above, mutations in S5 that cause resistance to 
=5 spectinomycin [24] do not make direct contacts with the antibiotic. Rather, they map to a loop that 
ly stabihzes the interaction between HI and the H35-H36 region which is directly connected to H34. 
P^; An attractive hypothesis is that the mxxtants destabhze this interaction, and by thus removing the 

network of interactions that stabihzes the conformation of the head to the body via S5, allows it to 

move even when spectinomycin is bound. 

Streptomycin 

Unlike paromomycin, which can bind to an isolated fragment of ribosomal RNA, streptomycin is 
2 5 tightly bound to the phosphate backbone of 16S RNA from 4 different parts of the molecule via both 
salt links and hydrogen bonds (Figure 3). It also makes contact with a lysine (K45) from ribosomal 
protein S12. The four regions of 16S RNA (1490, 915, 526, and 13) had all been implicated in 
streptomycin binding on the basis of protection [22], crosslinking [25] and mutagenesis data [26-28] 
(reviewed in [29]). 

30 

It has been proposed that translational fidelity involves a switch between two states of the ribosome, 
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an error prone or ram (885) state, characterized by nucleotides 910-912 pairing with 885-887, and a 
restrictive or hyperaccurate (888) state in which 910-912 pair with 888-890 [30], 

The structure of the 308 reported here, like that of the 70S ribosome at 7.8 A [31] is in the 885 
5 pairing configuration, and hence presumably in the ram state. The tight interactions described above 
suggest that streptomycin preferentially stabihzes this form. The 888 state A site has a low tRNA 
affinity, while the 885 state has a higher affinity [30, 32]. Therefore by stabihzing the 885 state 
streptomycin would be expected to increase initial binding of non-cognate tRNAs, The preferential 
stabihzation of the 885 state would also make the transition to the 888 state more difficult, thereby 
1 0 also affecting proofi-eading. Thus our results offer a structural rationale for the observed properties 
of streptomycin. 

This stabihzation of the 885 state by streptomycin suggested by our structure can explain much of 
P the genetic data on the antibiotic. Mutations in S12 lead to a hyper-accurate phenotype [32- 
3jl 38](reviewed by [39]) . A weak phenotype manifests itself as streptomycin resistance, where as a 

strong phenotype (often the result of multiple mutations) leads to streptomycin dependence. Most of 
^ these mutants are to varying degrees more hyperaccurate and slower than wild type ribosomes, 
Q consistent with destabilization of the 885 state with respect to the 888 state. 

21 All the mutations in S12 map to the a protein loops that connect and hold in place the 908-915 and 
524-527 regions, with the exception of one mutant K56 (E.coli K53) which contacts H44 (Figure 3). 
Thus S12 stabilizes the same region that is stabihzed by streptomycin. In the resistance mutations, 
the 885 state is destabilized sufficiently so that the additional stabilization mduced by streptomycin 
does not trap the ribosome in this state. In the streptomycin dependent mutants, the 885 state is so 

2 5 destabilized that the 888 (hyperaccurate) form predominates and protein synthesis becomes very 

slow. Streptomycin can then help stabilize the 885 state sufficiently to restore the balance between 
the two states and help restore translation. 

The K45R (E.coli K42) mutation is resistant to streptomycm but is not hyperaccurate [39]. K45 

3 0 forms a salt bridge with phosphate A913 and thus contributes to stabilization of the 885 state. It also 

makes direct hydrogen bonding contacts to two OH groups on streptomycin (Figure 3). Mutation of 
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this lysine to arginine, would disrupt the hydrogen bonding and thereby reduce the affinity of the 
3 OS for streptomycin, leading to resistance. However, the mutation would leave the salt bridge 
intact, so that the 885 form is not destabilized and thus translation remains normal. 

5 A number of mutations in rRNA also lead to hyperaccuracy [26, 27, 40-43]. Some of these 
nucleotides are involved in hydrogen bonding interactions in regions close to the streptomycin 
binding site. Thus the mutations disrupt interactions that help to stabihze the 885 state. Others such 
as A915 make no contacts with any other bases. It is possible that mutation of this base leads to 
more favourable contacts in the 888 state, thus acting by stabilizing the 888 state rather than 
10 destabilizing the 885 state. 

cj The ram mutations lead to error-prone ribosomes and are generally found as suppressors of 

J streptomycin resistance. These mutations in S4 [35, 44, 45] and S5 [12] can counter the effect of 

hyperaccurate mutations in S12 (reviewed by [39]). All ram mutants in S4 and S5 map near the 
nil interface between the two proteins with the exception of S52 (E.coli S49) which makes a direct 
j y hydrogen bond to the backbone of rRNA. At lower resolution [9], it appeared that the ram mutations 
destablized the S4-S5 interface. However, at atomic resolution, we see that two of the mutations V56 
\| (E.coU V53) in S4 and G99 (E.coli G103) in S5 are not in contact with the other protein, and it is not 

obvious from our structure how they would affect stability of the ram state. In fact, there is a cleft 
M between the two proteins which could close up on the 885-888 transition. This leads us to suggest 
that these residues (and perhaps the corresponding surfaces of S4 and S5) are involved in intimate 
contacts in the 888 state, and these contacts would be disrupted by the ram mutations. In this model, 
ram mutations act by destabilizing the 888 state, and thus shifting the equilibrium to the error prone 
885 state. The observation that ram mutations increase the affinity of ribosomes for streptomycin 
2 5 [46] is consistent with this model, since the ribosome would be preferentially in the 885 form. A 

definitive test of this model must await an atomic resolution structure of the 888 form. Nevertheless, 
our results provide a structural basis for the notion that a delicate balance exists between the 885 and 
888 states for optimal translation, and also explains how disruption of this balance leads to the 
various phenotypes observed. 

30 

Example 2 - Crystallization of Tetracycline to the 30S Ribosome. 
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Tetracyclines are antibiotics of broad specificity and have been used since the 1940's against a wide 
range of both Gram-negative and Gram-positive bacteria [47]. These drugs were the first so-called 
*broad-spectrum' antibiotics and have been used extensively in both human and veterinary medicine. 
However, in later years, the widespread use of tetracyclines has been limited by the emergence of 
5 significant microbial resistance to these antibiotics. Tetracyclines bind to the SOS ribosomal subunit 
[48] where it affects exclusively the binding of aminoacylated tRNA to the A-site [49]. There is no 
effect on the binding of tRNA to the P-site nor on the fidelity of translation [50]. Consistent with the 
inhibition of tRNA binding to the A-site during translation, tetracycline also prevents binding of 
both release factors RF-1 and 2 during termination, regardless of the stop codon [51]. In contrast to 
1 0 most antibiotics, resistance to tetracycline is usually not caused by mutations in 16S RNA or 

ribosomal proteins, but by the presence of several external protein factors which apparently mimic 
Q the structure and function of the elongation factors [52-54]. 

1'^ The crystal structure of tetracycline, or 4-(dimethylamino)-l,4,4a,5,5a,6,l l,12a-octahydro- 
tU 3,6, 1 0, 1 2, 1 2a-pentahydroxy-6-methyl- 1,11 -dioxo-2-naphtacenecarboxamide, in complex with the 
py 308 ribosomal subunit was determined at 3. OA resolution. 30S crystals were prepared as described in 

example 1 above and soaked post crystaUisation in SO^iM tetracycline. X-ray diffi-action data were 
^^■^1 collected at beamline ID14-4 at the European Synchrotron Radiation Facihty in Grenoble, France, 
i , f The location of the antibiotic within the 30S subunit was identified fi-om difference Fourier maps 

calculated after a few rounds of maximum-likelihood based refinement of the native 30S structure 

against the measured structure factor amplitudes. 

We have found two strong binding sites for tetracycUne within the 308 subunit, one located near the 
acceptor site for aminoacylated tRNA (the A-site) between the head and the body and another which 

2 5 is present at the interface between three domains in the body of the subunit. The discovery of more 

binding sites is not surprising, since tetracycline is known to have one primary binding site on the 
308 in addition to multiple secondary sites on both subunits [55]. In its primary binding site within 
the 308, tetracycline binds exclusively to the 3' major domain in the upper part of the crevice 
between the head of the 308 and the shoulder and right above the small gap between the stem-loop 

3 0 of H18 of the 5' domain and the long H44 of the 3' minor domain that constitutes the binding site for 

aminoacylated tRNA. The molecule fits into a small pocket created by residues in H34 that deviate 
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from the canonical A- form RNA double helical confomiation in combination with a part of the small 
H31 stem-loop structure. The contacts to H31 are quite tenuous, however, and the binding of the 
antibiotic to 16S RNA seems to depend ahnost exclusively on the interaction with H34. In contrast 
to what has been proposed earlier, there are no proteins involved in the primary binding of 
5 tetracycline. The second binding site of tetracycline that we observe clearly in the difference maps 
(although perhaps not as clearly as the primary binding site), is located in the body of the subunit, in 
close proximity to the penultimate H44 and sandwiched between the functionally important H27 in 
the central domain and the very top of HI 1 in the 5' domain of 16S RNA. The binding site it 
confined on one side by a major groove of H27 (residues 891-894:908-91 1) and the edge of the 
1 0 curved HI 1 (residues 242-245). The bulged-out base U244, which reaches across and makes a non 

Watson-Crick bases pair with C893 in H27 forms the bottom of the binding site. Again, all 
P interaction between the antibiotic and the ribosome is mediated by the RNA, however, the long N- 
-!! terminal extension of S12 comes very close to the tetracycline (-- 8 A, Argl9). The binding pocket is 
□ approximately 14A wide and 7A deep. 
M 

Example 3 - Crvstallization of Pactamvcin to the 3 OS Ribosome. 

^1 Pactamycin was isolated from S. pactum as a potential new hxmian antitumor antibiotic and is a 

potent inhibitor of translation in both eukaryotes and prokaryotes [56]. It is believed to inhibit the 
2;Q initiation process, i.e. the initiation factor mediated binding of fMet-tRNA to the ribosome [57], by 
sequestering the initiation complex in the A-site [58]. This effectively prevents the formation of 
entire 70S ribosomes and thus halts the translation. The effect upon binding is similar for 
prokaryotes and eukaryotes [59], 

2 5 The crystal structure of pactamycin, or 2-hydroxy-6-methylbenzoic acid [5-[(3-acetylphenyl)amino]- 

4-amino-3-[[(dimethylamino)-carbonyl]amino]-l,2-dihydroxy-3-(l-hydroxyethyl)-2- 
methylcyclopentyl]methyl ester, in complex with the 30S ribosomal subunit was determined at 3.lA 
resolution. 30S crystals were prepared as described as above and soaked post crystallisation in 80|iiM 
pactamycin. X-ray diffraction data were collected at beamline ID 14-4 at the European Synchrotron 

3 0 Radiation Facility in Grenoble, France. The location of the antibiotic within the 30S subunit was 

identified from difference Fourier maps calculated after a few rounds of maximum-likelihood based 
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refinement of the native 308 structure against the measured structure factor amplitudes. 

Pactamycin binds to the upper part of the platform, very close to the cleft in the subunit that is 
responsible for binding of the three tRNA molecules. We have only found a single strong binding 
site of pactamycin in the 308. The antibiotic interacts primarily with residues at the apices of the 
H23b stem loop in the central domain of 168 RNA in addition to a couple of bases from the nearby 
H24a (Figure 5). The site of binding is very close to the 3' minor domain and the ultimate H45, but 
there is not direct interaction with this region. In its binding to the RNA, pactamycin extends the 
stacking of bases in the tetra loop of H23b and mimics RNA both with respect to the bases and the 
sugar phosphate backbone. In this region, The H24a loop forms a regular helical stem loop to which 
the H23b stem loop is attached with interactions mainly between bulging bases in H23b and the 
backbone of H24a. The bases near the apices of H23b curve around and pack into the major groove 
of H24b, and this trend is extended by two "bases" by a single pactamycin molecule. The antibiotic 
folds up so that the two distal 6-carbon rings are stacked against each other like nucleotides with the 
5-carbon in between resembling a sugar ring. The NCON(CH3)2 extension on the central ring even to 
some extent mimics the phosphate ester moiety of RNA. The nearest proteins are 87 and 811, and 
there appears to be a weak hydrogen bond to the backbone carbonyl of GlySl of 87. 

In Figure 5 we have indicated an interaction with a residue "U1450mRNA". From our data this 
residue is not continuous with the main 168 RNA sequence, but appears as part of a stretch of 
mRNA which appears in the crystal. However, it is beUeved that it represents the very end (3' end) 
of 168 RNA, based on sequence, thus it has been given sequence numbers from the 168 in this 
figure (1539-1544). However, in the table of coordinates, these residues have been separately 
numbered 1-6, the coordinates being shown immediately following those of the 168 RNA. 

Even though pactamycin has been described as binding primarily to the ribosomal P-site [57, 60], 
the observed protections for this antibiotic can be regarded as pertaining rather to the E-site. This 
notion is in more agreement with the present structure, in which the two "bases" of pactamycin 
coincide with the two last bases of the E-site codon of mRNA as observed in the native 308 structure 
[61]. In fact, pactamycin together with the first base in the E-site codon of the native structure form a 
triplet codon mimic in approximately the right position for interaction with an E-site tRNA. 
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However, in the antibiotic bound structure, the actual position of the bases in the E-site codon is 
shifted remarkably, in a way that precludes a possible interaction with an E-site bound tRNA. In the 
native 30S structure, a kink is observed in the backbone of the messenger RNA at the interface 
between the P- and E-sites, however, the overall path of the mRNA is still relatively straight and 
5 leads between the long 73-90 beta hairpin of S7 and the stem loops of H23 and H24a of the platform. 
In the pactamycin-bound structure, however, the mRNA in the E-site is pushed towards the back of 
the subunit, and in between H28 of the head and the hairpin of S7. This is a remarkable distortion 
that comprises on average 12.5A for the last of the bases in the E-site codon. 

10 Example 4 - Crvstallization of Hvgromvcin B to the 30S Ribosome. 

Hygromycin B is a monosubstituted 2-deoxystreptamine-containing aminoglycoside antibiotic 
{;!} originally isolated as a secondary antibiotic substance from S. hygroscopicus [62], It is an unusual 
^ aminoglycoside antibiotic in that it is active against both prokaryotic and eukaryotic cells and differs 

in structure from other aminoglycosides [63]. The drug works primarily by inhibiting the 

11 translocation step of elongation [63-65] and to a lesser extent causes misreading of messenger RNA 
[50, 65]. The antibiotic affects EF-2 (EF-G) mediated translocation of A-site bound tRNA to the P- 

1; „ site in eukaryotes, but does not affect either binding of the factor to the ribosome or the hydrolysis of 
\i the bound GTP, a process that has been shown to be separate fi-om translocation [63]. The mhibition 
; ~\ of translocation is accompanied by an increase in the affinity of the A-site for aminoacylated tRNA 
[50]. 

The crystal structure of hygromycin B, or 0-6-amino-6-deoxy-L-glycero-D-galacto- 
heptopyranosyUdene-(1^2-3)-0-P-D-talopyranosyl-(1^5)-2-deoxy-N^-methyl-D-streptamine, in 
complex with the 30S ribosomal subunit was determined at 3.1 A resolution. 30S crystals were 
2 5 prepared as described above, and soaked post crystallisation in SOpM hygromycin. X-ray diffraction 
data were collected at beamline ID 14-4 at the European Synchrotron Radiation Facility in Grenoble, 
France. The location of the antibiotic within the 30S subunit was identified from difference Fourier 
maps calculated after a few rounds of maximum-likelihood based refinement of the native 30S 
structure against the measured structure factor amplitudes. 

30 

Hygromycin B has a single clear binding site within the 30S consistent with the finding that it has a 
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monophasic effect on translation [66]. It binds close to the very top of the long, penultimate helix 44 
of 16S RNA, in a region that contains the and E-site tRNA binding sites. The antibiotic is in 

contact only with 16S RNA (not any proteins), and only with helix 44. In fact, it is located in the 
major groove of the helix, very close to the helical axis, and thus surrounded by residues from both 
RNA strands in the region 1490-1500 and 1400-1410. Hygromycin B almost exclusively contacts 
the bases, as opposed to the backbone, of RNA, and would on this basis be expected to be highly 
sequence-specific. The nearest protein is SI 2, which is known to be important in decoding, but is 
more than 14A away from the hygromycin binding site. Binding of hygromycin B does not seem to 
induce any significant alterations in the structure of RNA, and appears to be govemed by strong 
base-specific hydrogen-bonds spanning more than three sequential bases in one strand of helix 44. 
This is possible because the structure of the three-ring antibiotic is unfolded in its binding site within 
the 308 and thus makes the molecule about 13A long. 

Hygromycin B binds to the BOS in an important fimctional region which is also the target for other 
antibiotics such as paromomycin and gentamycin. Both these antibiotics bind fiirther down helix 44 
than does hygromycin and thus affect adenosines A1492 and A1493 which have been implicated as 
crucial in decoding. Interestingly, Ring II of paromomycin, which is also found in other 
aminoglycoside antibiotics including gentamycin, adopts an almost identical orientation as Ring I of 
hygromycin, only about 3 A further down the helix, or exactly what corresponds to one residue. This 
indicates that this type of six-ring is an important general means of antibiotic binding, since 
aboUshment of the interaction with RNA (in the case of hygromycin) leads to resistance [67, 68]. 

Although the foregoing invention has been described in some detail by way of illustration 
and example for purposes of clarity of understanding, it will be apparent to those skilled in the art 
that certain changes and modifications may be practiced. Therefore, the descriptions and examples 
should not be construed as hmiting the scope of the invention, which is delineated by the appended 
claims. 
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